Optimizing the NAS Parallel BT Application for the POWER CHALLENGEarray
نویسندگان
چکیده
The POWER CHALLENGEarray is a coarse-grained collection of large processor SMP nodes. This creates interesting parallelization opportunities for scalable applications. The NAS BT benchmark is a classical ADI-like application with non-trivial communication requirements. The coarse-grained distributed feature of the POWER CHALLENGEarray provides unique parallelization strategies. We explore the implementation of this benchmark on this machine and discuss the general implications for scalable application development
منابع مشابه
Early experiences with OpenMP on the Origin 2000
OpenMP has been marketed as THE emerging standard for shared memory parallelism (SMP). The rst compiler for OpenMP is now available on the Cray Origin 2000. In this paper we report on some early experiences with this compiler on a (quasi-)application code, an implementation of the NAS, BT benchmark. OpenMP includes, of course, the traditional do-loop parallelization. For programmers familiar wi...
متن کاملUnderlying Constructs of Farmers’ Perceptions towards Bt Cotton Among Former Cotton Farmers in Northern Ghana: Empirical Application of Q Methodology
It is often argued that learning from best examples in the neighbouring Burkina Faso and elsewhere, Ghana can succeed in revamping the collapsing cotton industry by introducing Bt cotton to farmers. This paper therefore presents a survey findings on farmers’ views and perceptions towards the possible introduction of Bt cotton. A stratified random sampling techniques was applied in selecting 254...
متن کاملPerformance Characteristics of Hybrid MPI/OpenMP Implementations of NAS Parallel Benchmarks SP and BT on Large-Scale Multicore Clusters
The NAS Parallel Benchmarks (NPB) are well-known applications with the fixed algorithms for evaluating parallel systems and tools. Multicore clusters provide a natural programming paradigm for hybrid programs, whereby OpenMP can be used with the data sharing with the multicores that comprise a node and MPI can be used with the communication between nodes. In this paper, we use SP and BT benchma...
متن کاملPerformance Coupling: Case Studies for Measuring the Interactions of Kernels in Modern Applications
Traditional performance optimization techniques have focused on nding the kernel in an application that is the most time consuming and attempting to optimize it. In this paper we focus on optimization techniques for a more global perspective of the application. In particular, we present a methodolodgy for measuring the interaction or coupling between kernels within an application and describe h...
متن کاملGeneCrunch and Europort
The SGI POWER CHALLENGEarray TM represents a hierarchical supercomputer because it combines distributed and shared memory technology. We present two projects, Europort and GeneCrunch, that took advantage of such a configuration. In Europort we performed scalability demonstrations up to 64 processors with applications relevant to the chemical and pharmaceutical industries. GeneCrunch, a project ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008